AITopics | mm algorithm

Collaborating Authors

mm algorithm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

15cf76466b97264765356fcc56d801d1-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 20:32:44 GMT

data mining, hawke process, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.95)

Industry:

Information Technology (1.00)
Energy (1.00)
Health & Medicine (0.93)
Government > Regional Government > North America Government > United States Government (0.69)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)

Add feedback

c3c617a9b80b3ae1ebd868b0017cc349-Supplemental.pdf

Neural Information Processing SystemsFeb-11-2026, 01:33:05 GMT

algorithm, divergence, uot, (15 more...)

Neural Information Processing Systems

Country:

Europe > France > Normandy > Seine-Maritime > Rouen (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Europe > France > Provence-Alpes-Côte d'Azur (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.41)

Add feedback

c3c617a9b80b3ae1ebd868b0017cc349-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 01:33:01 GMT

algorithm, divergence, uot, (13 more...)

Neural Information Processing Systems

Country:

Europe > France > Normandy > Seine-Maritime > Rouen (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Europe > France > Provence-Alpes-Côte d'Azur (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Revisiting Incremental Stochastic Majorization-Minimization Algorithms with Applications to Mixture of Experts

Tran, TrungKhang, Nguyen, TrungTin, Fort, Gersende, Doan, Tung, Nguyen, Hien Duy, Nguyen, Binh T., Forbes, Florence, Drovandi, Christopher

arXiv.org Machine LearningJan-28-2026

Processing high-volume, streaming data is increasingly common in modern statistics and machine learning, where batch-mode algorithms are often impractical because they require repeated passes over the full dataset. This has motivated incremental stochastic estimation methods, including the incremental stochastic Expectation-Maximization (EM) algorithm formulated via stochastic approximation. In this work, we revisit and analyze an incremental stochastic variant of the Majorization-Minimization (MM) algorithm, which generalizes incremental stochastic EM as a special case. Our approach relaxes key EM requirements, such as explicit latent-variable representations, enabling broader applicability and greater algorithmic flexibility. We establish theoretical guarantees for the incremental stochastic MM algorithm, proving consistency in the sense that the iterates converge to a stationary point characterized by a vanishing gradient of the objective. We demonstrate these advantages on a softmax-gated mixture of experts (MoE) regression problem, for which no stochastic EM algorithm is available. Empirically, our method consistently outperforms widely used stochastic optimizers, including stochastic gradient descent, root mean square propagation, adaptive moment estimation, and second-order clipped stochastic optimization. These results support the development of new incremental stochastic algorithms, given the central role of softmax-gated MoE architectures in contemporary deep neural networks for heterogeneous data modeling. Beyond synthetic experiments, we also validate practical effectiveness on two real-world datasets, including a bioinformatics study of dent maize genotypes under drought stress that integrates high-dimensional proteomics with ecophysiological traits, where incremental stochastic MM yields stable gains in predictive performance.

artificial intelligence, citedonpage4, machine learning, (20 more...)

arXiv.org Machine Learning

2601.19811

Country:

Asia (1.00)
Europe > France (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Generalized Linear Model Regression under Distance-to-set Penalties

Jason Xu, Eric Chi, Kenneth Lange

Neural Information Processing SystemsNov-21-2025, 04:04:29 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, regression, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > North Carolina (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report > New Finding (0.69)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.70)

Add feedback

Unbalanced Optimal Transport through Non-negative Penalized Linear Regression

Neural Information Processing SystemsAug-17-2025, 06:19:42 GMT

This paper addresses the problem of Unbalanced Optimal Transport (UOT) in which the marginal conditions are relaxed (using weighted penalties in lieu of equality) and no additional regularization is enforced on the OT plan.

algorithm, optimal transport, uot, (15 more...)

Neural Information Processing Systems

Country:

Europe > France > Normandy > Seine-Maritime > Rouen (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Europe > France > Provence-Alpes-Côte d'Azur (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.41)

Add feedback

Unbalanced Optimal Transport through Non-negative Penalized Linear Regression

Neural Information Processing SystemsAug-17-2025, 06:19:38 GMT

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > France > Normandy > Seine-Maritime > Rouen (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Europe > France > Provence-Alpes-Côte d'Azur (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.41)

Add feedback

Generalized Linear Model Regression under Distance-to-set Penalties

Jason Xu, Eric Chi, Kenneth Lange

Neural Information Processing SystemsOct-2-2024, 15:57:49 GMT

Estimation in generalized linear models (GLM) is complicated by the presence of constraints. One can handle constraints by maximizing a penalized log-likelihood. Penalties such as the lasso are effective in high dimensions, but often lead to unwanted shrinkage. This paper explores instead penalizing the squared distance to constraint sets. Distance penalties are more flexible than algebraic and regularization penalties, and avoid the drawback of shrinkage. To optimize distance penalized objectives, we make use of the majorization-minimization principle. Resulting algorithms constructed within this framework are amenable to acceleration and come with global convergence guarantees. Applications to shape constraints, sparse regression, and rank-restricted matrix regression on synthetic and real data showcase strong empirical performance, even under non-convex constraints.

algorithm, constraint, regression, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > North Carolina (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report > New Finding (0.69)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.70)

Add feedback

BadMerging: Backdoor Attacks Against Model Merging

Zhang, Jinghuai, Chi, Jianfeng, Li, Zheng, Cai, Kunlin, Zhang, Yang, Tian, Yuan

arXiv.org Artificial IntelligenceSep-2-2024

Fine-tuning pre-trained models for downstream tasks has led to a proliferation of open-sourced task-specific models. Recently, Model Merging (MM) has emerged as an effective approach to facilitate knowledge transfer among these independently fine-tuned models. MM directly combines multiple fine-tuned task-specific models into a merged model without additional training, and the resulting model shows enhanced capabilities in multiple tasks. Although MM provides great utility, it may come with security risks because an adversary can exploit MM to affect multiple downstream tasks. However, the security risks of MM have barely been studied. In this paper, we first find that MM, as a new learning paradigm, introduces unique challenges for existing backdoor attacks due to the merging process. To address these challenges, we introduce BadMerging, the first backdoor attack specifically designed for MM. Notably, BadMerging allows an adversary to compromise the entire merged model by contributing as few as one backdoored task-specific model. BadMerging comprises a two-stage attack mechanism and a novel feature-interpolation-based loss to enhance the robustness of embedded backdoors against the changes of different merging parameters. Considering that a merged model may incorporate tasks from different domains, BadMerging can jointly compromise the tasks provided by the adversary (on-task attack) and other contributors (off-task attack) and solve the corresponding unique challenges with novel attack designs. Extensive experiments show that BadMerging achieves remarkable attacks against various MM algorithms. Our ablation study demonstrates that the proposed attack designs can progressively contribute to the attack performance. Finally, we show that prior defense mechanisms fail to defend against our attacks, highlighting the need for more advanced defense.

adversary, badmerging, target class, (14 more...)

arXiv.org Artificial Intelligence

2408.07362

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
Asia > Nepal (0.04)

Genre: Research Report > New Finding (0.93)

Industry:

Information Technology > Security & Privacy (1.00)
Transportation > Ground > Road (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Preserving the Privacy of Reward Functions in MDPs through Deception

Chirra, Shashank Reddy, Varakantham, Pradeep, Paruchuri, Praveen

arXiv.org Artificial IntelligenceJul-13-2024

Preserving the privacy of preferences (or rewards) of a sequential decision-making agent when decisions are observable is crucial in many physical and cybersecurity domains. For instance, in wildlife monitoring, agents must allocate patrolling resources without revealing animal locations to poachers. This paper addresses privacy preservation in planning over a sequence of actions in MDPs, where the reward function represents the preference structure to be protected. Observers can use Inverse RL (IRL) to learn these preferences, making this a challenging task. Current research on differential privacy in reward functions fails to ensure guarantee on the minimum expected reward and offers theoretical guarantees that are inadequate against IRL-based observers. To bridge this gap, we propose a novel approach rooted in the theory of deception. Deception includes two models: dissimulation (hiding the truth) and simulation (showing the wrong). Our first contribution theoretically demonstrates significant privacy leaks in existing dissimulation-based methods. Our second contribution is a novel RL-based planning algorithm that uses simulation to effectively address these privacy concerns while ensuring a guarantee on the expected reward. Experiments on multiple benchmark problems show that our approach outperforms previous methods in preserving reward function privacy.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2407.09809

Country: